Phylogeny Inference of Closely Related Bacterial Genomes: Combining the Features of Both Overlapping Genes and Collinear Genomic Regions
نویسندگان
چکیده
Overlapping genes (OGs) represent one type of widespread genomic feature in bacterial genomes and have been used as rare genomic markers in phylogeny inference of closely related bacterial species. However, the inference may experience a decrease in performance for phylogenomic analysis of too closely or too distantly related genomes. Another drawback of OGs as phylogenetic markers is that they usually take little account of the effects of genomic rearrangement on the similarity estimation, such as intra-chromosome/genome translocations, horizontal gene transfer, and gene losses. To explore such effects on the accuracy of phylogeny reconstruction, we combine phylogenetic signals of OGs with collinear genomic regions, here called locally collinear blocks (LCBs). By putting these together, we refine our previous metric of pairwise similarity between two closely related bacterial genomes. As a case study, we used this new method to reconstruct the phylogenies of 88 Enterobacteriale genomes of the class Gammaproteobacteria. Our results demonstrated that the topological accuracy of the inferred phylogeny was improved when both OGs and LCBs were simultaneously considered, suggesting that combining these two phylogenetic markers may reduce, to some extent, the influence of gene loss on phylogeny inference. Such phylogenomic studies, we believe, will help us to explore a more effective approach to increasing the robustness of phylogeny reconstruction of closely related bacterial organisms.
منابع مشابه
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees and Supermatrices
Over 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequences is greatly enhanced when we have an understanding of how they are phylogenetically related to...
متن کاملDetection of genomic islands via segmental genome heterogeneity
While the recognition of genomic islands can be a powerful mechanism for identifying genes that distinguish related bacteria, few methods have been developed to identify them specifically. Rather, identification of islands often begins with cataloging individual genes likely to have been recently introduced into the genome; regions with many putative alien genes are then examined for other feat...
متن کاملThe Phylogeny of Calligonum and Pteropyrum (Polygonaceae) Based on Nuclear Ribosomal DNA ITS and Chloroplast trnL-F Sequences
This study represents phylogenetic analyses of two woody polygonaceous genera Calligonum and Pteropyrum using both chloroplast fragment (trnL-F) and the nuclear ribosomal internal transcribed spacer (nrDNA ITS) sequence data. All inferred phylogenies using parsimony and Bayesian methods showed that Calligonum and Pteropyrum are both monophyletic and closely related taxa. They have no affinity w...
متن کاملPathophysiologic mechanisms of obesity- and chronic inflammation-related genes in etiology of polycystic ovary syndrome
Objective(s): One of the common heterogeneous reproductive disorders in women of childbearing age is polycystic ovary syndrome (PCOS). It is characterized by lack of fertility due to anovulatory cycles, hyperandrogenemia, polycystic ovaries, hyperinsulinemia, and obesity. Both reproductive anomalies and metabolic disorders are involved in PCOS pathology. Although the r...
متن کاملRecognizing the pseudogenes in bacterial genomes
Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The nu...
متن کامل